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BACKGROUND OF THE INVENTION 

Field of the Invention 

[0001] The present invention relates to computer-implemented methods and 

systems of determining the affinity between polypeptide amino acid residues 
and one or more molecular fragments. The invention further provides 
methods and systems of using the affinity values to aid in drug design. 

Related Art 

[0002] The action of a particular drug is believed to be due to the interaction 

of that drug with a particular molecular target, such as a protein, nucleic acid, 
or other molecule found in the biological system. Typical protein drug targets 
include enzymes and receptors. Thomas, G., "Medicinal Chemistry - An 
Introduction" (John Wiley & Sons, Ltd., New York, 2001). The binding of the 
drug to the active or other sites (allosteric sites) of an enzyme usually has the 
effect of preventing the normal operation of that enzyme. Similarly, drugs act 
on receptors by binding to or near to a specific receptor that may either 
activate the receptor or prevent the binding of the receptor's normal substrate 
to that receptor. Ultimately, both of these actions can result in a physiological 
response that may have a therapeutic effect. The drug's effectiveness will 
depend on the stability of the drug-enzyme or drug-receptor complex and the 
number of sites occupied by the drug. Other targets for drug action include 
nucleic acids and other naturally occurring molecules. Id. 
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[0003] To rationally develop a drug lead, therefore, it is desirable to know the 

binding site on the target molecule (e.g., enzyme, receptor or nucleic acid). 
One approach to determining protein binding sites is protein mapping, where 
molecular probes, such as small organic molecules or functional groups are 
placed around the protein surface to determine the most favorable binding 
positions (Dennis et aL, PNAS 99:4290-4295 (2002)). Experimental 
approaches to protein mapping include x-ray crystallography and NMR 
methods. Id. Both of these approaches have shown that probes, even those 
generally unrelated to any natural substrate of the protein, bind to only a 
limited number of positions on the protein. Generally, a pocket of the active 
site tends to form a consensus site that binds many ligands, regardless of their 
sizes and polarities. Id. 

[0004] Because of the experimental difficulties associated with co- 

crystallizing proteins and probes, or the use of NMR to determine binding 
sites, a number of methods have been developed to. perform mapping 
computationally rather than experimentally (e.g, the drug design program 
GRID (Goodford, P.J., J. Med. Chem. 25:849-875 (1985) and the multiple 
copy simultaneous search (MCSS) strategy (Miranker, A. & Karplus, M., 
Proteins Struct. Funct. Genet. 77:29-34 (1991)). 

[0005] The major problem with these approaches, however, is that they result 

in many energy minima along the surface of the protein, making it difficult to 
determine which of the minima is actually relevant (Dennis et aL, PNAS 
99:4290-4295 (2002)). 

[0006] In one approach, a mapping algorithm was developed that uses an 

empirical free energy function to determine the active sites of egg white 
lysozyme and thermolysin. Id. This study reported that the sites identified by 
computational mapping agreed with those identified experimentally. 

[0007] The MCSS tool, discussed above, uses a fragment-based computational 

approach to identify binding sites and, as discussed above, is essentially based 
on an energy minimization approach, providing fragment states corresponding 
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to various local minima of the fragment-protein interaction potential energy 
field. 

[0008] Improved computational methods are necessary to provide accurate, 

quantitative estimates of the free energy of binding of molecular fragments to 
potential binding sites so that ligands can be designed for these sites. Such a 
computational method should yield values which can be used to determine the 
affinity of particular fragment-residue pairs, so that the results of particular 
simulations can be compared, the degree of convergence of a particular 
simulation can be determined and binding sites and key fragments can be 
identified. 

SUMMARY OF THE INVENTION 

[0009] Recognizing the tremendous need for accurate determinations of the 

interaction energies between fragments and the amino acid residues of 
polypeptide molecules, the present inventors have developed methods and 
systems of determining the affinity values of fragment-residue pairs. 

[0010] Accordingly, the present invention provides methods and systems of 

determining the affinity between polypeptide amino acid residues and one or 
more molecular fragments. The invention includes conducting a computer 
simulation of (i) a polypeptide and (ii) at least one molecular fragment, 
wherein at least one interaction energy is calculated between the polypeptide 
and at least one molecular fragment, wherein each calculated interaction 
energy is associated with a position of said at least one molecular fragment; 
and assigning an affinity value to at least one fragment and residue pair when 
said fragment is in the vicinity of the residue, wherein said affinity value is a 
measure of the free energy of interaction between the polypeptide and the 
fragment; wherein the above calculations are conducted for each molecular 
fragment present in the computer simulation. 

[0011] The present invention further provides methods and systems of using 

the affinity values of the present invention to, for example, determine the 
degree of convergence of a particular simulation, compare the results of 
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multiple computer-implemented simulations, identify protein binding sites, 
and help determine the key fragments to use in constructing ligands for a 
given polypeptide. 

DETAILED DESCRIPTION OF THE INVENTION 

[0012] Terms are used herein as generally used in the art, unless otherwise 

defined herein. 

[0013] In one aspect, the present invention provides methods and systems of 

determining the affinity between polypeptide amino acid residues and one or 
more molecular fragments. In one embodiment, the present invention includes 
conducting a computer simulation of (i) a polypeptide, and (ii) at least one 
molecular fragment, wherein at least one interaction energy is calculated 
between said polypeptide and said at least one molecular fragment, wherein 
each calculated interaction energy is associated with a position of said at least 
one molecular fragment; and assigning an affinity value to at least one 
fragment and residue pair when said fragment is in the vicinity of the residue, 
wherein said affinity value is a measure of the free energy of interaction 
between the polypeptide and the fragment; wherein the interaction energy and 
the affinity value is determined for each molecular fragment present in the 
computer simulation. 

[0014] As used herein, the term "polypeptide" encompasses a molecule 

comprised of amino acid molecules linked by peptide bonds, and includes all 
such molecules, regardless of the number of amino acids in the molecule. The 
term polypeptide, as used herein, also includes molecules which include other 
moieties in addition to amino acids, such as glycosylated polypeptides, e.g., 
antibodies. The term polypeptide, as used herein, also includes protein 
molecules which consist of more than one chain of amino acids linked by 
peptide bonds; the multiple chains may be covalently bonded to each other by 
means of disulfide sidechain bonds. 

[0015] "Fragments," as the term is used herein, includes molecules or 

molecular fragments (e.g., radicals) that can be used to model one or more 
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interaction with a macromolecule, such as the interactions of carbonyls, 
hydroxyls, amides, hydrocarbons, and the like. Examples of useful fragments 
include: 



Name 


Structure 


Acetone 


CH 3 (C=0)CH 3 


Aldehyde 


H(CO)-CH 3 


Amide 


H(C=0)NH 2 


Ammonia 


NH 3 


Benzene 








Carboxylic Acid 


CH 3 COOH 


1 ,4-Diazine 








Ester 


CH 3 -0-(C=0)-CH 3 


Ether 


CH 3 -0-CH 3 


Formaldehyde 


H 2 00 


Furan 




O 


Imidazole 


k> 


Methane 


CH 4 


Methanol 


CH3OH 


Phospho-Acid 


1 

HO 

C 


1 

> OH 

)H 
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Name 


Structure 


Pyridine 








Pyrimidine 


IN 






Pyrrole 




N 


Thiol 


CH 3 SH 


Thiophene 




S 



[0016] Preferably, the fragments selected are representative of chemical 

features that have proven useful in the design of pharmaceuticals or other 
bioactive chemicals. Additional fragments will be readily apparent to one 
skilled in the art. 

[0017] In one aspect of the methods of the present invention, a computer 

simulation of a polypeptide and at least one molecular fragment is conducted, 
wherein at least one interaction energy is calculated between the polypeptide 
and the at least one molecular fragment, wherein each calculated interaction 
energy is associated with a position of the molecular fragment. For example, 
in one aspect of the present invention, the calculation of interaction energy 
between the polypeptide and each molecular fragment, uses a Monte Carlo 
method to explore the protein/fragment confirmation space. 

[0018] In one embodiment, the three dimensional structure of a target protein, 

usually obtainable experimentally from x-ray crystallography, is known and 
the basic interactions between the protein and the small fragments (e.g., 
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average molecular weight of approximately 150) are computed. This 
computation can be carried out by Monte Carlo (MC) modeling and analysis 
(usually implemented in software) for a large collection of organic fragments 
with diverse physico-chemical properties. The number of fragments can be in 
the hundreds to thousands. For this purpose, Locus Pharmaceuticals, Inc., 
Blue Bell, PA, developed the Locus Monte Carlo (LMC) code. For each rigid 
fragment instance, a set of attributes are saved, including: 

(x,y,z), q = (q } , qi, q3, qd, fragment-protein energy, 

where (x,y,z) are the coordinates of the fragment's center of mass, and q is the 
quaternion characterizing its orientation. 

[0019] This MC data for the different fragments is analyzed for identifying 

potential binding sites using the methods of the present invention. These tools 
are based on the postulate that a binding site must be a high affinity region for 
a diverse collection of fragments. In one aspect, experimental binding site 
data (e.g., co-crystal X-ray data and residue mutational analysis), if available, 
is used to determine the final site within which the leads are designed. 

[0020] In another aspect, the actual thermodynamic fragment distributions 

around the protein, i.e., distributions consistent with thermal fluctuations at 
physiological temperatures are calculated. Information on the thermodynamic 
distribution is essential for computing free energies of binding, which is the 
basic biologically relevant quantity for quantifying the binding affinity of a 
ligand. The MCSS approach, by contrast, is essentially based on an energy 
minimization approach, providing fragment states corresponding to various 
local minima of the fragment-protein interaction potential energy field. Such a 
procedure is computationally more expeditious than computing the actual 
physical distributions, but is unable to provide information on entropic effects, 
essential for free energy estimates. 

[0021] For computing the thermodynamic distributions, MC simulation 

packages often make use of a Metropolis Monte Carlo approach (Metropolis, 
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N., et ah, J. Chem. Physics 27:1087-1092 (1953)) for sampling from a grand- 
canonical ensemble of states (Adams, D.J., Molecular Physics 29:307-311 
(1975); Mezei, M., Molecular Physics 57:565-582 (1987)). In addition to 
exchanging just energy with a surrounding thermal bath, as in the case of a 
canonical ensemble, the system described by a grand-canonical ensemble may 
exchange particles or fragments as well. The energy cost associated with 
inserting/deleting a fragment from the system is controlled by its chemical 
potential. By varying this chemical potential, so-called simulated annealing of 
the chemical potential, one may vary the average number of fragments in the 
simulation system. It can be shown that measuring the values of the chemical 
potential at which fragments leave various sites on the protein provides an 
estimate of the free energy of binding for the different binding modes over the 
protein surface. 

[0022] The practicality of the simulated annealing procedure for estimating 

binding affinities was demonstrated by Guarnieri and Mezei for differentiating 
hydration propensities of different DNA grooves (Guarnieri, F. and Mezei, M., 
J. Am. Chem. Soc. 775:8493-8494 (1996)). These results were obtained with 
the Metropolis Monte Carlo (MMC) code developed by the group of Mezei, 
Mount Sinai School of Medicine, NY. For these simulations, the system was 
composed of a molecule fraction of DNA surrounded by a varying number of 
interacting water molecules. In one embodiment, the LMC algorithm carries 
out a similar calculation for all fragments with respect to the target protein. 

[0023] Accordingly, in an embodiment of the present invention, a 

macromolecule is analyzed for potential binding sites. For example, this 
analysis can be accomplished by (1) positioning an instance of a computer 
representation of a molecule or molecular fragment at a plurality of sampling 
sites of the macromolecule; (2) selecting a value of B, wherein B = |a7kT + 
ln<N>, where \\! is the excess chemical potential, k is Boltzmann's constant, T 
is the absolute temperature, and <N> is the mean number of molecules of the 
molecule or molecular fragment; (3 Repositioning the instances of the 
molecule or molecular fragment; (4)accepting or rejecting each instance of the 
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repositioned molecule or molecular fragment based on the Metropolis 
sampling criteria using the computed binding energy compared to the selected 
value of B; (5) repeating steps (1) through (4) at a lesser value of B; and 
outputting a list of unrejected instances of the molecule or molecular 
fragment; wherein the molecule or molecular fragment is an organic fragment. 

[0024] In an embodiment, the sampling sites comprise an unbiased sampling 

of sites of the macromolecule. 

[0025] In another aspect of the invention, a Monte Carlo based computer 

simulation is conducted which excludes the fragment-fragment interactions. It 
has been acknowledged that considering fragment-fragment interactions may 
be detrimental to the interpretation of the simulation results for all fragments 
but water. Indeed, due to the high dilution of the solute molecules in actual 
biochemical relevant conditions, considering interactions between non-water 
fragments may not be realistic. Furthermore, the drug leads assembled by 
LCD usually are composed of only one fragment of each type. Fragment- 
fragment interactions in a simulation may thus lead to detrimental correlation 
effects. 

[0026] Therefore, in an embodiment, the interaction between a given fragment 

and a protein is analyzed by sampling the fragment states from a 
thermodynamically relevant Grand-Canonical distribution. The underlying 
sampling algorithm is a weighted Metropolis Monte Carlo approach, described 
herein as weighted Grand-Canonical Metropolis Monte Carlo (WGCMMC) 
sampling. The weighting procedure is implemented by subdividing space with 
an orthogonal, equidistant grid. Each grid cell x is assigned a local, numerical 
chemical potential field value B nU m,( x ) 5 which is adapted iteratively to ensure 
an approximately uniform numerical sampling of fragment states around the 
protein. B nU m is related to the thermodynamic cost of inserting or removing a 
fragment, and its local value defines the weight for each sampled fragment 
state. 

[0027] Once the B num field has sufficiently converged, and the Markov chain 

associated to the MC sampling has equilibrated, the Markov chain can be 
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sampled periodically at successive decorrelated states. Positions, orientations, 
potential energies and statistical weights for all fragment states are saved. 
Binding modes are then identified and corresponding binding free energies 
estimated. 

[0028] This approach makes use of a Grand-Canonical Metropolis Monte 

Carlo algorithm for sampling fragments around the target protein. This 
sampling data can then be directly used for estimating the free energy of 
binding for different binding modes of the fragment on the protein surface. 
This approach distinguishes itself from the MMC process, in that it removes 
fragment-fragment interactions. 

[0029] It turns out that the standard Monte Carlo approach has difficulty in 

handling simulations where fragment-fragment interactions are removed. 
Indeed, the absence of fragment-fragment interactions can lead to a broad 
range of fragment densities between the high and low affinity binding sites on 
the protein. This results from the possible overlap of fragments. The standard 
Metropolis Monte Carlo scheme used in conventional approaches has trouble 
resolving this dynamical range in densities. This problem is overcome with a 
weighted Monte Carlo scheme. 

[0030] Therefore, in an embodiment, a linear Monte Carlo approach (i.e., 

without fragment-fragment interactions) is used to calculate the protein- 
fragment interaction energies. This scheme can be described as "linear" in 
reference to the linear properties of the Liousville equation in the absence of 
fragment-fragment interactions. Liousville's equation describes the time 
evolution of the system away from equilibrium. 

[0031] First, the derivation of the grand canonical distribution for WGCMMC 

is presented below. 

[0032] The potential energy of the system composed of N fragments is 

denoted t/(77 N). In general, U includes both contributions from fragment- 
protein and fragment-fragment interactions. The configuration of the system 
is characterized by 

r=(x,K,-,Hr)l (1) 
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where Yi=(X[, Qj) stands for the position Xj and orientation Qj of fragment L 
[0033] In the grand canonical ensemble, the probability that the system has N 

fragments in configuration Y is given by 

/(r,A0= ^y^j^cxp[BN-^U(T t N)l (2) 
with the normalization factor given by the grand partition function 

Q= t ^exp(^)f-^exp[-p(/(r^)]. (3) 

Here V is the volume of the system, a is the volume of orientational space, and 
B is related to the excess chemical potential ja ex , i.e. the energy cost in units of 
P' 1 = KT for a particle to leave the system: 

B= p^ x + logAf, (4) 
where N is the average number of fragments in the system. 
[0034] Assuming no fragment- fragment interactions, the potential energy U of 

the system becomes: 

£/(r,A0=i>(4 (5) 

where E(Yj) is the energy of interaction of the ith fragment with the protein. 
[0035] The Grand Partition Function can then be written as 

Q= lLjn(eMB)j— exp[-p£(y)]J =expZ, (6) 

with 

Z = exp(5) J^exp[- ^E(Y)\ (7) 

[0036] The probability P(N) for having N fragments in the system is then 

given by 

UN) = \dY N f(Y,N) = exp(-Z)^. (8) 

As expected, this is simply the Poisson distribution with parameter Z. In 
particular, the average number of fragments in the system is given by 
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(N) = £ NP(N) = Z, (9) 

which thus scales exponentially with B. 
[0037] In fact, more generally, the probability P(n,AV) of finding n fragments 

in any given subvolume AV of configuration space is given by a Poisson 
distribution: 

--^»s%^-**4"iM^. (io) 

= z"/n\, 

with 

z= exp(i?)[^exp[-pi<(7)]. (11) 
[0038] Finally, the single fragment density is given by 

fjy)=t N\dY 2 ...\dY N f{Y = (y,Y 2> ...,Y n ),n) 

= exp(-2)^e X p[ S -p £ (y)]|^Z<»-" 

= ^-exp(5-p£(i')], (12) 

which again scales exponentially with respect to B. 

[0039] Equation (12) for the single fragment density shows the large 

dynamical range that may result from the exponential dependence of this 
quantity with respect to the single fragment-protein potential energy E(Y). 
This dependence results from the possible overlap of the non-interacting 
fragments. This was not an issue in the presence of fragment-fragment 
interactions, as an upper bound to the fragment density was then set by the 
tightest possible packing of the molecules. 

[0040] The underlying method developed for WGCMMC to enable the 

accurate resolution of the above-mentioned dynamical range in densities is 
presented here. 
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[0041] For numerical purposes, instead of considering a constant B value, one 

may consider a field B num (Y) in the single particle configuration space Y. This 
field represents the energy cost for a particle to leave the system specifically 
from position Y. In this case, the density of states in the grand canonical 
ensemble (2) is given by 



, . 1 1 1 

f™jy > N )- Q v n n Af t exp 



/=1 



with the normalization factor (grand partition function) now given by 

00 1 dY N N 



(14) 



(15) 



[0042] A similar derivation as the one used for obtaining Eq. (12) leads to the 

corresponding single fragment density: 

= -^-exp[* num (r) - VE(Y)]. (16) 

[0043] Equation (16) shows that through the field B num (Y), the amplitude of 

the density in each position Y of the single particle configuration space can be 
calculated. Thus, by iteratively adapting B num (Y) during the convergence 
phase of the Metropolis MC simulation, one may obtain appropriate sampling 
in all regions of interest. This is achieved by taking 

5 num (r)-min(p^(r)-f const,5 max ), (17) 
leading to similar numerical densities of fragment instances in various regions 
of space. An upper bound B max is set on B num to avoid unnecessary sampling 
in strongly unfavorable positions, i.e., essentially for configurations leading to 
steric clashes. In practice, the field B num (Y) is chosen to be independent of the 
fragment orientation, and to be piece-wise constant on a 3-D grid in x-space 
(translational-space). 

[0044] Making use of the exponential dependence in B of the density, one can 

infer the physical fragment density f %c (Y) at any B=B 0 = constant value from 
the simulation results for a given numerical B num (Y) field. Assume that one 
has a sampling ^T\=(Yi y ...,Y Ni ) },=i, n snap of n snap snapshots from the numerical 
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distribution y^ c ,num(r\N). The average of any single fragment quantity A(Y) 
over the distribution^^ is then given by 



U)= jdYfjY)A(Y)= j d Yf ecnu jY)-/^A(Y) 

J gc,num V 1 ) 

"snap 



(18) 



I "snap /V, 

snap /=1 7 = 1 

where wj is the weight assigned to the fragment state Y jf and defined by 

Wj = T^M = exp ( 5 ° ~ Bmm ^- (19) 

[0045] The scan over the B schedule carried out in an MMC-type annihilation 

simulation is thus replaced in a WGCMMC run by a simulation for a single 
B num (Y) field. 

[0046] The following addresses how the WGCMMC data can be handled and 

analyzed. 

[0047] The starting point for the data interpretation is the relation linking the 

WGCMMC data to the association constant K a characterizing the binding of 
the considered fragment to a given region on the protein. As a reminder, this 
relation is rederived here. 

[0048] The association constant K a characterizes the equilibrium of the 

binding process 

F + Po FP, (20) 

and is defined by 

[FP] 

*° = 1515]' (21) 

where [P], [F]> and [FP] are respectively the concentrations of protein alone, 
fragment alone, and of a particular protein-fragment complex (binding mode). 
The association constant is indeed the basic biologically relevant quantity. 
[0049] In the case of the LMC system, consider a single protein in a volume 

V. For the sake of the following discussion, take V to be large, although for 
the actual MC simulation this need not be the case. The protein concentration 
is therefore given by [P] = 1/V. Furthermore, note that n is the average 
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number of fragments in the binding volume AV b (in general a volume with 

limits both in translational and orientational space), and N is the average total 

number of fragments in the system, so that [F] = (N - n)/V and [FP] = n/V. 

The association constant can thus be written 

n/V n_ 
Ka ~ {N-n)/V\/V^ V N (22) 

having invoked the thermodynamic limit of large volume V, so that n«N 
(N/V II const, for V Yl <*>). The values n and N can be obtained from the 
fragment density (12): 

" = L b dY f*W = £ l Vb dYc^E{Y)\ (23) 

N= [dYf gc {Y)= ^-^7exp[-p£(F)]-e B , (24) 

having again invoked the assumption of the high protein dilution, so that the 
total system volume V is much larger than the effective region of interaction 
between the fragment and the protein and thus one may consider E(Y )z 0 in 
deriving Eq. (24). The association constant now becomes: 

K a = ^l v dY^p[-^E(Y)\ (25) 

[0050] On the basis of Eq. (25) one can also write the association constant in 

terms of the free energy of binding AA 

K a = Vexp{-$AA). (26) 

where AA = Aw — Af, with A F p and A F the free energies of the fragment- 
protein complex and of the fragment alone respectively: 

A FP = -|log( j AK ^7exp[-pE(r)]), (27) 

A F = -jlogldY=-j\og(Vo). (28) 

[0051] The critical value B c that is associated to the binding volume AV b can 

be defined as the value for which the average number of fragments in the 
binding site is 1. From Eq. (23) follows: 
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n(B c )=\ o e- B < = ^-l y dYexp[-VE(Y)\ (29) 

and from (25), (26) and (29) one finally obtains: 

K a = Ve' B % (30) 

* A = l B c- (31) 

Thus, a low Be value reflects a high affinity binding mode, and inversely a 
higher Be value reflects a lower affinity mode. 
[0052] The critical value B c can be computed from the WGCMMC data using 

definition (29), as well as Eqs (18) and (19): 



1 /?snap r / \1 

l =4 B c)=— Z Z expk-5 num r 7 

"snap t=l fragjeAV b J 

1 "snap , x 



o B c = -log 



"snap (=1 fragjeAV b 



(32) 



[0053] Equations (30), (31) and (32) provide the basic relations for how the 

WGCMMC data is to be interpreted. 

[0054] Following the computer simulation which calculates the interaction 

energy between a polypeptide and at least one molecular fragment, an affinity 
value is assigned to at least one fragment and residue pair when the fragment 
is in the vicinity of the residue. A fragment is defined as being in the vicinity 
of a residue when at least one pair of fragment-residue atoms (z, j) is within a 
predetermined threshold distance, wherein said threshold distance is based on 
the sum of the Van der Waals radii of said fragment-residue atoms. In an 
embodiment of the present invention, the predetermined threshold distance is 
defined as follows: 

rij<a(Rvdw,i + Rvdwj) (32) 
wherein ry is the distance between the two atoms, Rvdw is the Van der Waals 
radius and a is a numerical parameter. In an embodiment, a is between about 
0.5 and about 2.0, and preferably is about 1.2. In one aspect the Van der 
Waals radius is about half the Lennard-Jones parameter from a molecular 
mechanics force-field. In an aspect of the present invention, the molecular 
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mechanics force field is selected from the group consisting of AMBER, 
GROMOS, CHARMM, Xplor, Discover, MMFFF and Tripos. AMBER is a 
particularly preferred force field. 
, [0055] As discussed above, the affinity value that is assigned to any particular 

fragment-residue pair is a measure of the free energy of interaction between 
the polypeptide and fragment, thus, both enthalpic and entropic contributions 
are included. 

[0056] In an aspect of the present invention, when a Monte Carlo based 

computer simulation is used which includes fragment-fragment interactions, 
and a simulated annealing of chemical potential is conducted, the affinity 
value comprises B-critical. B critical is defined as the minimum B value for 
which a particular fragment is persistently observed in the vicinity of a 
residue, wherein B = jx'/kT + ln<N>, where is the excess chemical potential, 
k is the Boltzmann's constant, T is the absolute temperature, and <N> is the 
mean number of molecules of the molecular fragment. In an embodiment, a 
particular type of fragment is persistently observed in the vicinity of a residue 
when the average number of fragments in the vicinity of the residue is 
between 0.8 and 1.0. In another aspect of the present invention, a particular 
type of fragment is persistently observed in the vicinity of a residue when the 
average number of fragments in the vicinity is greater than or equal to 0.9. 

[0057] In another embodiment, with respect to linear Monte Carlo, B-critical 

is also used to determine the fragment/residue affinity. Accordingly, the 
binding affinity of a fragment for different regions on the protein surface can 
be estimated by assigning a critical B c to each fragment-residue pair. These B c 
values are obtained from the WGCMMC data by applying relation (32), where 
the volume AV b is approximated for each residue on the basis of a proximity 
criteria. 

[0058] The volume defined on the basis of the proximity criteria may only be 

an estimate of a binding mode volume. The corresponding B c values must 
therefore be used as estimates of free energy of binding. Nonetheless, 
comparing sets of B c values for different fragments has proven valuable to 
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help identify protein binding sites as follows: A binding site is identified as a 
set of residues with low B c values (high affinity) for multiple fragments with 
diverse physico-chemical properties. This approach is based on the assumption 
that diverse interactions in a localized region are the necessary condition for 
ensuring the specificity of a binding site. Preferably, this numerical 
localization of the binding site is complemented by experimental binding 
information such as co-crystal X-ray data and mutational analysis. 

[0059] Improved estimates for the binding mode volumes AV b , compared to 

the above described residue-based proximity criteria, provide more accurate 
estimates of free energy of binding using Eq. (32). Such improved binding 
mode volume estimates are determined and represented by clustering sampled 
fragment states belonging to the same potential energy well. For this purpose 
the potential energies saved for the sampled fragment states are used. 

[0060] In another aspect of the present invention, following the computer 

simulation of the polypeptide and at least one fragment and the assignment of 
affinity values to specific fragment/residue pairs, a binding analysis profile is 
outputted that comprises a matrix of affinity values for each fragment-residue 
pair. 

[0061] In an embodiment of the present invention, numerous separate 

computer simulations are conducted on a particular polypeptide, wherein in 
each simulation a different fragment type interacts with the protein. For 

N 

example, a simulation of polypeptide A is conducted with fragment X, 
wherein interaction energies are calculated, and affinity values assigned to 
fragment/residue pairs as described above. A computer simulation of 
polypeptide A is then conducted with fragment Y, wherein interaction 
energies are calculated, and affinity values assigned to fragment/residue pairs 
as described above, etc. 
[0062] When separate simulations are conducted for a given polypeptide, a 

separate affinity value matrix can be generated for each fragment type. In this 
way the output can enable a ranking of the residues with respect to average 
fragment-binding ability for a given residue. A matrix of affinity values can 
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be generated which is averaged over fragments types, and the polypeptide 
surface is coded according to average fragment binding affinity. For example, 
residues with highest fragment binding affinity values are a different color 
from the residues with the lowest affinity values. In this way, potential 
binding sites on the macromolecule can be identified. For example, the output 
can be visual, displaying the residues on the macromolecule's surface in 
different colors across the visible light spectrum from red (highest average 
residue-fragment affinity) to blue (lowest average residue-fragment affinity). 
In such an output, the potential binding sites with higher probabilities of being 
actual binding sites will appear as groups of residues colored red or closer to 
red in the spectrum. 

[0063] The residue-fragment affinity can also be used to measure the degree 

of convergence of a simulation. For example, a matrix of B-critical values for 
each residue-molecular fragment pair can be created (an "affinity profile"). 
Convergence is considered to have occurred when the affinity profile remains 
constant (or stops changing) within a predetermined threshold range. 

[0064] The residue-fragment affinity can also be used to measure the degree 

of difference between simulations. For example, the affinity profile of two or 
more simulations can be compared, and the absolute and statistical measures 
of their variance can be calculated. 

[0065] The residue-fragment affinity can also be used to identify key 

fragments which can be used to design ligands (i.e., drug candidates). For one 
or more selected residues, molecular fragments can be ranked according to 
affinity value. For example, for a selected residue, the molecular fragments 
can be listed in ascending or descending order of residue-fragment affinity. 
Similarly, in an embodiment, the invention enables the display of a table of 
residues for each fragment that highlights the regions on the protein for which 
the fragment has the highest affinity. 

[0066] The present invention is described in further detail in the following 

non-limiting examples. 
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EXAMPLES 



EXAMPLE 1 

[0067] The following data in Table 1 was generated from a simulation 

conducted according to the methods of the present invention on the protein 
Caspase-3. Amino acids are listed on the left hand side, while the fragments 
are listed at the top. The binding affinities associated with the fragment- 
residue pairs are listed. 

[0068] Table 1 - Fragment Binding Affinity for Caspase-3 
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acetone 


zene 


xylic acid 


dimethyl 
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ethanol 


zole 
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dine 
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hydro- 
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0 


0 


0 
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0 
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0 
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0 
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-27.307 


0 


0 
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0 
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0 
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0 


0 


0 
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0 


0 


0 


0 
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MET 
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0 


0 


0 
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0 
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0 


0 


0 
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0 


0 


0 


0 


0 
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0 


0 


0 


0 


0 


0 


0 


0 


0 


0 
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0 
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0 
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0 


0 
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0 


0 


0 
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0 


0 


0 


0 


0 


0 
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0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


MET 
A 61 


-31.098 


-14.530 


0 


-28.593 


-29.171 


-23.233 


-22.307 


0 


0 


-13.976 


-30.528 


0 
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0 
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0 
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0 


0 
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0 


0 
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0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


LEU 
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0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


ARG 
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0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


GLU 
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0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


THR 
A 77 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 
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0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


ARG 
A 79 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 
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0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 
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0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 
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0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 
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0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 
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A 84 


0 


0 


0 


-12.593 


0 


0 


0 


0 


0 


0 


0 


-18 
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0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 
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A 86 


0 


0 


0 


0 


0 


0 


0 


0 
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0 


0 


0 
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0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 
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0 


0 


-21.171 


0 
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0 


-12.472 


-12.976 
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0 


ASN 
A 89 


0 


0 


0 


0 
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0 


0 


0 


-12.472 


0 


0 


0 


ASP 
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0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 
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A 91 


-27.098 


-15.530 


0 


0 


-17.171 


0 
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0 


0 


-11.976 


-26.528 


0 
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A 92 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 
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0 


0 


0 


0 


0 


0 


0 


0 
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0 


0 


0 


GLU 
A94 


0 


0 


0 
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0 


0 


0 


0 


0 
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0 
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96 
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0 


0 


GLU 
A 173 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


NME 

A 999 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


ACE 
E0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


HID E 
185 


0 


-15.530 


0 


-19.593 


-18.171 


0 


-19.307 


0 


0 


0 


-18.528 


-16 


LYS 
E 186 


0 


0 


0 


-15.593 


-18.171 


-12.233 


0 


0 


0 


0 


0 


-16 


ILE E 
187 


-21 .098 


-15.530 


0 


0 


-20.171 


0 


-22.307 


0 


-13.472 


-11.976 


-23.528 


0 


PRO 
E 188 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


VAL 
E 189 


0 


0 


-5.808 


0 


0 


0 


0 


0 


0 


0 


0 


0 


ASP 
E 190 


0 


0 


-7.808 


0 


0 


-11.233 


0 


0 


0 


0 


0 


0 


ALA 
E 191 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


-19 


ASP 
E 192 


-21.098 


0 


0 


0 


0 


0 


-21 .307 


0 


-13.472 


0 


-22.528 


-19 


PHE 
E193 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


LEU 

E 194 


o 


o 


Q 


0 


0 


0 


0 


0 


0 


0 


0 


0 


TYR 
E 195 


0 


0 


-7.808 


0 


0 


-11.233 


0 


0 


0 


0 


0 


0 


ALA 
E 196 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


TYR 
E 197 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


SER 
E 198 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


THR 

E 199 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


ALA 

E 200 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


PRO 
E201 


0 , 


0 


0 


0 


0 


0 


0 


0 


-14.472 


0 


0 


0 


GLY 
E202 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 



207526_2.DOC 



24- 





acet- 
amide 


acetone 


ben- 
zene 


carbo- 
xylic acid 


dimethyl 
sulfoxide 


ethanol 


imida- 
zole 


iso- 
butane 


pyri mi- 
dine 


tetra- 
hydro- 
furan 


urea 


H 2 0 


^203 






0 


0 


0 


0 


0 


0 


0 


-9.976 


0 


0 


TYR 
E204 


0 


0 


0 


-12.593 


-29.171 


0 


0 


0 


-11.472 


-12.976 


0 


0 


SER 
E205 


-31 .098 


-30.530 


0 


-28.593 


-29.171 


-26.233 


-35.307 


-6.330 


-17.472 


-23.976 


-34.528 


-23 


TRP 

E206 


-26.098 


0 


0 


0 


0 


0 


0 


-6.330 


-11.472 


0 


-26.528 


0 


ARG 
E207 


-31 .098 


-30.530 


-5.808 


-28.593 


-29.171 


-26.233 


-35.307 


-6.330 


-17.472 


-23.976 


-34.528 


-15 


ASN 
E208 


0 


0 


0 


0 


0 


0 


0 


0 


0 


-9.976 


0 


0 


SER 
E209 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


LYS 
E210 


0 


0 


0 


-13.593 


0 


0 


0 


0 


0 


0 


0 


0 


ASP 
E211 


0 


0 


0 


-13.593 


0 


0 


0 


0 


0 


0 


0 


0 


GLY 
E212 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


SER 
E213 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


TRP 
E214 


0 


0 


0 


0 


0 


0 


0 


0 


0 


-9.976 


0 


0 


PHE 
E215 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


ILE E 

216 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


GLN 
E217 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


SER 
E218 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


LEU 
E219 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


CYS 
E220 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


ALA 
E221 


0 


0 


0 


0 


! 0 


0 


0 


0 


0 


0 


0 


0 


MET 

E222 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


LEU 
E223 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


LYS 
E224 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


GLN 
E225 


0 


0 


0 


0 


0 


0 


-29.307 


0 


-11.472 


0 


0 


-18 


TYR 
E226 


0 


0 


0 


0 


0 


0 


-29.307 


0 


-11.472 


0 


0 


-18 


ALA 
E 227 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


E228 


0 


0 


0 


0 


0 


0 


-29.307 


0 


-11.472 


0 


0 


-18 


LYS 
E229 


0 


0 


0 


0 


0 


0 


-29.307 


0 


-12.472 


0 


0 


-18 


LEU 
E230 


-18.098 


-1 3.530 


0 


-16.593 


-17.171 


-1 5.233 


-21 .307 


0 


-12.472 


0 


-20.528 


-16 


GLU 
E 231 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


PHE 
E232 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


MET 
E233 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


HIE E 

234 


0 


0 


0 


-1 6.593 


0 


0 


0 


0 


0 


0 


0 


-26 


ILE E 
235 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


LEU 
E236 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


THR 
E237 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


ARG 
E238 


-18.098 


-13.530 


o 


-16 593 




-15.233 


-22.307 


0 


-12.472 


-9.976 


-20.528 


-26 


VAL 
E239 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


ASN 
E240 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


ARG 
E241 


-22.098 


-16.530 


-5.808 


0 


-20.171 


0 


-24.307 


-4.330 


-12.472 


-15.976 


-19.528 


-23 


LYS 
E242 


-18.098 


0 


0 


0 


0 


0 


-24.307 


0 


-9.472 


0 


-19.528 


0 


VAL 
E243 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


ALA 
E244 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


THR 
E245 


0 


-15.530 


0 


-16.593 


-20.171 


0 


0 


-4.330 


-1 1 .472 


-15.976 


0 


0 


GLU | 
E246 


-22.098 


-16.530 


0 


0 


-19.171 


0 


-24.307 


0 


-12.472 


0 


-19.528 


0 



207526 2.DOC 



-25 - 





acet- 
amide 


acetone 


ben- 
zene 


carbo- 
xylic acid 


dimethyl 
sulfoxide 


ethanol 


imida- 
zole 


iso- 
butane 


pyrimi- 
dine 


tetra- 
hydro- 
furan 


urea 


H 2 0 


PHE 
E247 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


GLU 
E248 


0 


0 


0 


-16.593 


0 


0 


0 


0 


0 


-9.976 


0 


0 


SER 
E249 


0 


0 


0 


0 


0 


0 


0 


0 


0 


-9.976 


0 


0 


PHE 
E250 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


SER 
E251 


0 


0 


0 


O 


0 


0 


0 


0 


0 


0 


0 


0 


PHE 

E252 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


ASP 
E253 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


ALA 
E254 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


THR 

E255 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


PHE 

E256 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


HIE E 

257 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


ALA 

E258 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


LYS 
E259 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


LYS 
E260 


0 


0 


0 


-16.593 


0 


0 


0 


0 


0 


0 


0 


0 


GLN 
E 261 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


ILE E 

262 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


PRO 
E263 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


CYS 
E264 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


ILE E 
265 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


VAL 
E266 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


SER 
E 267 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


MET 
E268 


0 


0 


-7.808 


0 


0 


-1 1 .233 


0 


0 


0 


0 


0 


0 


LEU 
E 269 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


THR 
E270 


-21.098 


0 


-6.808 


0 


0 


0 


-21.307 


0 


-13.472 


0 


-22.528 


0 


LYS 
E271 


-21.098 


-15.530 


-6.808 


0 


-20.171 


-12.233 


-22.307 


0 


-13.472 


-1 1 .976 


-22.528 


0 


GLU 
E272 


-25.098 


-18.530 


0 


-19.593 


' -23.171 


-22.233 


-27.307 


0 


-13.472 


-12.976 


-22.528 


-21 


LEU 
E 273 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


TYR 
E274 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


PHE 

E275 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


TYR 
E276 


0 


0 


0 


0 


0 


0 


0 


-4.330 


0 


0 


0 


0 


HIE E 

277 


0 


0 


0 


0 


0 


0 


0 


-4.330 


0 


0 


0 


0 


NME 

E999 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


ACE 
BO 


-18.098 


-13.530 


0 


-16.593 


-17.171 


-15.233 


-21.307 


0 


-12.472 


0 


-20.528 


-16 


ASN 
B35 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


-23 


SER 
B36 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


B37 


-23.098 


0 


0 


0 


0 


0 


-27.307 


0 


0 


0 


0 


0 


LYS 
B38 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


MET 
B39 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


ASP 
B40 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


TYR 
B41 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


PRO 
B42 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


GLU 
B43 


0 


0 


0 


0 


0 


0 


-22.307 


0 


0 


0 


0 


-17 


MET 
B44 


0 


0 


0 


0 


0 


0 


0 


0 


. 0 


0 


0 


0 


GLY 
B45 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


-17 | 



207526_2.DOC 



-26- 





acet- 
amide 


acetone 


ben- 
zene 


carbo- 
xyiic acid 


dimethyl 
sulfoxide 


ethanol 


imida- 
zole 


iso- 
butane 


pyri mi- 
dine 


tetra- 
hydro- 
furan 


urea 




LEU 
B46 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


CYS 
B47 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


ILE B 
48 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


ILE B 
49 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


ILE B 
50 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


ASN 
B51 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


ASN 
B52 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


LYS 
B53 


-22.098 


0 


0 


-15.593 


-20.171 


-14.233 


0 


0 


0 


-11.976 


-17.528 


0 


ASN 
B54 


0 


0 


0 


0 


0 


-1 1 .233 


0 


0 


0 


0 


0 


0 


PHE 
B55 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


HIE B 

56 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


LYS 
B57 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


SER 
B58 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


THR 
B59 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


GLY 
B60 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


MET 
B61 


-27.098 


0 


0 


0 


-29.171 


-22.233 


-31.307 


-4.330 


-16.472 


-19.976 


0 


0 


THR 
B62 


0 


0 


0 


0 


0 


0 


0 


0 


-12.472 


0 


0 


0 


SER 
B63 


0 


0 


0 


0 


0 


-1 1 .233 


0 


0 


0 


0 


0 


0 


ARG 
B64 


-27.098 


-16.530 


0 


-21.593 


-29.171 


-18.233 


-35.307 


-4.330 


-29.472 


-19.976 


-34.528 


-21 


SER 
B65 


-22.098 


0 


0 


-15.593 


-20.171 


-14.233 


0 


0 


0 


-1 1 .976 


-17.528 


0 


GLY 
B66 


-22.098 


0 


0 


-15.593 


-20.171 


-14.233 


0 


0 


0 


-11.976 


-17.528 


0 


THR 
B67 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


ASP 
B68 


-22.098 


0 


0 


-15.593 


-20.171 


-14.233 


0 


0 


0 


-11.976 


-17.528 


0 


VAL 
B69 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


ASP 
B70 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


ALA 
B71 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


ALA 
B72 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


ASN 
B73 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


LEU 
B74 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


ARG 
B75 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


GLU 
B76 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


THR 
B77 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


PHE 
B78 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


ARG 
B79 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


ASN 
B 80 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


LEU 
B81 






0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


LYS 
B 82 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


TYR 
B83 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


GLU 
B 84 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


-17 


VAL 
B85 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


ARG 
B86 


0 


-15.530 


0 


0 


-21.171 


0 


0 


0 


0 


-9.976 


0 


0 


ASN | 
B87 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


LYS 
B88 


0 


-1 5.530 


0 


0 


-21.171 


0 


-19.307 


0 


0 


-9.976 


0 


0 


ASN 
B89 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 



207526_2.DOC 



-27- 





acet- 
arnide 


acetone 


ben- 
zene 


carbo- 
xylic acid 


dimethyl 
sulfoxide 


ethanol 


imida- 
zole 


iso- 
butane 


pyrimi- 
dine 


tetra- 
hydro- 
furan 


urea 


H 2 0 


ASP 
B90 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


LEU 
B91 


0 


0 


0 


0 


0 


0 


-19.307 


0 


0 


0 


0 


0 


THR 

B92 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


ARG 
B93 




0 


0 


0 


-20.171 


0 


-29.307 


0 


0 


0 


-18.528 


-17 


GLU 
B94 


Ol AAA 


-19.530 


-5.808 


-13.593 


-25.171 


0 


-29.307 


0 


-9.472 


-13.976 


-23.528 


-16 


GLU 
B95 


0 


0 


0 


0 


0 


0 


-19.307 


0 


0 


0 


0 


0 


ILE B 

96 


0 


0 


0 


0 


0 


0 


! 0 


0 


0 


0 


0 


0 


VAL 
B 97 


-23.098 


-19.530 


0 


-13.593 


-25.171 


0 


-30.307 


0 


-16.472 


-13.976 


-19.528 


-16 


GLU 
B98 


-23.098 


-19.530 


0 


-15.593 


-25.171 


0 


-30.307 


0 


-16.472 


0 


-19.528 


0 


LEU 
B99 


0 


-13.530 


0 


0 


-20.171 


0 


0 


0 


0 


-9.976 


0 


0 


MET 
B 100 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


ARG 
B 101 


-22.098 


0 


0 


-15.593 


0 


0 


-30.307 


0 


-16.472 


0 


-19.528 


0 


ASP 
B 102 


-22.098 


0 


0 


-15.593 


0 


0 


0 


0 


0 


0 


-18.528 


0 


VAL 
B 103 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


SER 
B 104 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


LYS 
B 105 




0 


0 


-15.593 


0 


0 


0 


0 


O 


0 


-18.528 


0 


GLU 
B 106 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


ASP 
B 107 


0 


0 


0 


0 


0 


0 


-21.307 


0 


0 


0 


0 


0 


HJE B 
108 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


SER 
B 109 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


LYS 
B 110 


0 


0 


0 


0 


0 


0 


-22.307 


0 


0 


0 


0 


0 


ARG 
B 111 


0 


0 


0 


0 


0 


0 


-21.307 


0 


0 


0 


0 


-17 


SER 
B 112 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


SER 
B 113 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


PHE 
B 114 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


VAL 
B 115 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


CYS 
B 116 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


VAL 
B117 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


LEU 
B 118 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


LEU 
B 119 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


SER 
B 120 


-26.098 


-16.530 


0 


-21 .593 


-27.171 


-26.233 


-35.307 


-4.330 


-29.472 


-19.976 


-30.528 


-21 


HIP B 
121 


-31 .098 


-25.530 


0 


-27.593 


-29.171 


-26.233 


-35.307 


-6.330 


-29.472 


-19.976 


-35.528 


-23 


GLY 
B 122 


-31.098 


-25.530 


0 


-17.593 


-29.171 


-22.233 


-31 .307 


-4.330 


-16.472 


-19.976 


-35.528 


-21 


GLU 
B 123 


0 


-25.530 


0 


0 


-29.171 


0 


0 


0 


0 


0 


0 


0 


GLU 
B 124 


0 


0 


0 


0 


0 


0 


0 


0 


-13.472 


0 


0 


0 


GLY 
B 125 


o 


o 


o 






0 


0 


0 


0 


0 


0 


0 


ILE B 
126 


0 


0 ! 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


ILE B 
127 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


PHE 
B 128 


0 


-16.530 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


GLY 
B 129 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


THR 
B 130 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


ASN 
B 131 


0 


0 


0 


0 | 


0 


0 


0 


0 


0 


0 


0 


0 


GLY 
B 132 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


PRO 
B133 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 



207526_2.DOC 



-28- 





acet- 
amide 


acetone 


ben- 
zene 


carbo- 
xylic acid 


dimethyl 
sulfoxide 


ethanol 


imida- 
zole 


iso- 
butane 


pyri mi- 
dine 


tetra- 
hydro- 
furan 


urea 


H 2 0 


VAL 
B 134 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


ASP 
B 135 


0 


0 


0 


0 


-20.171 


0 


0 


0 


0 


0 


0 


0 


LEU 

B 136 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


LYS 
B 137 


-18.098 


0 


0 


0 


0 


0 


-24.307 


0 


-9.472 


0 


-20.528 


0 


LYS 
B 138 


-24.098 


-19.530 


-5.808 


0 


-25.171 


0 


-30.307 


0 


-16.472 


-13.976 


-23.528 


-16 


ILE B 

139 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


THR 
B 140 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


ASN 
B 141 


-18.098 


0 


0 


0 


0 


0 


-24.307 


0 


-10.472 


0 


-20.528 


0 


PHE 
B 142 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


PHE 
B 143 


0 


0 


0 


0 


0 


0 


O 


0 


0 


0 


0 


0 


ARG 
B 144 


-26.098 


-16.530 


0 


0 


-25.171 


0 


-30.307 


0 


-10.472 


-10.976 


-25.528 


0 


GLY 
B 145 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


ASP 
B 146 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


ARG 
B 147 


0 


0 


0 


0 


0 


0 


O 


0 


0 


0 


0 


0 


CYS 
B 148 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


ARG 
B 149 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


SER 
B 150 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


LEU 
B 151 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


THR 
B 152 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


-20 


GLY 
B 153 


-23.098 


-15.530 


-7.808 


0 


-20.171 


0 


-27.307 


-4.330 


-1 1 .472 


-15.976 


0 


-20 


LYS 
B 154 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


PRO 
B 155 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


LYS 
B 156 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


-20 


LEU 
B 157 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


PHE 
B 158 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


ILE B 
159 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


ILE B 
160 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


GLN 
B 161 


-27.098 


-16.530 


0 


0 


-29.171 


-18.233 


-19.307 


-4.330 


-17.472 


-19.976 


-26.528 


0 


ALA 
B 162 


-31.098 


-25.530 


0 


-21.593 


-29.171 


-26.233 


-35.307 


-4.330 


-17.472 


-19.976 


-34.528 


-21 


CYM 
B 163 


-31 .098 


-25.530 


0 


-27.593 


-29.171 


-26.233 


-35.307 


-6.330 


-29.472 


-19.976 


-34.528 


-23 


ARG 
B 164 


0 


0 


0 


0 


0 


0 


0 


0 


-14.472 


0 


0 


0 


GLY 
B 165 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


THR 
B 166 


0 


0 


0 


0 j 


0 


0 


0 


0 


0 


0 


0 


0 


GLU 
B 167 


-24.098 


-13.530 


0 


-13.593 


-16.171 


-15.233 


-25.307 


0 


0 


-9.976 


-23.528 


-21 


LEU 
B 168 


-24.098 


0 


-5.808 


0 


-16.171 


-15.233 


-21.307 


0 


0 


-9.976 


0 


0 


ASP 
B 169 


o 


o 


Q 


0 


0 


0 


0 


0 


0 


0 


0 


0 


CYS 
B 170 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


GLY 
B 171 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


ILE B 
172 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


GLU 
B 173 


0 


0 


0 


0 


0 


0 


-19.307 


0 


0 


0 


-22.528 


0 


NME 
B999 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


ACE 
FO 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


HID F 
185 


0 


-14.530 


-7.808 


0 


-20.171 


0 


-27.307 


-4.330 


-11.472 


-15.976 


0 


0 


LYS 
F 186 


o : 


0 


0 


-16.593 


0 


0 


0 


0 


0 


0 


0 


0 



207526_2.DOC 



29- 





acet- 
amide 


acetone 


ben- 
zene 


carbo- 
xytic acid 


dimethyl 
sulfoxide 


ethanol 


imida- 
zole 


iso- 
butane 


pyri mi- 
dine 


tetra- 
hydro- 
furan 


urea 


H z O 


ILE 
F 187 


-23.098 


-14.530 


0 


0 


-20.171 


0 


-27.307 


-4.330 


-11.472 


-15.976 


0 


-20 


PRO 
F 188 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


VAL 
F 189 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


ASP 
F 190 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


ALA 
F 191 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


-20 


ASP 
F 192 


-23.098 


-14.530 


0 


0 


-20.171 


0 


-27.307 


0 


-11.472 


-15.976 


0 


-20 


PHE 

F 193 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


LEU 
F 194 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


TYR 
F 195 


0 


; o 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


ALA 
F 196 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


TYR 
F 197 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


SER 
F 198 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


THR 
F 199 


0 


0 


0 


0 


0 


0 


0 


0 


0 


! o 


0 


0 


ALA 
F200 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


PRO 
F201 


0 


0 


-7.808 


-13.593 


0 


-11.233 


0 


-4.330 


-12.472 


0 


0 


-21 


GLY 
F202 


0 


0 


0 


-13.593 


0 


0 


0 


0 


0 


0 


0 


0 


TYR 
F203 


-24.098 


0 


-7.808 


0 


0 


-15.233 


0 


0 


0 


0 


-18.528 


-21 


TYR 
F204 


0 


0 


0 


0 


-29.171 


-11.233 


0 


-4.330 


-17.472 


-15.976 


0 


0 


SER 
F205 


-31.098 


-25.530 


0 


-21 .593 


-29.171 


-26.233 


-35.307 


-4.330 


-29.472 


-19.976 


-34.528 


-21 


TRP 
F206 


0 


0 


0 


0 


0 


0 


0 


-4.330 


-17.472 


0 


0 


0 


ARG 
F207 


-31 .098 


-25.530 


-5.808 


-21.593 


-29.171 


-26.233 


-35.307 


-4.330 


-29.472 


-19.976 


-34.528 


0 


ASN 
F208 


0 


0 


0 


0 


0 


0 


0 


0 


-10.472 


0 


0 


0 


SER 
F209 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


LYS 
F210 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


ASP 
F211 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


GLY 
F212 


0 


0 


0 


0 


0 


0 


0 


0 


0 : 


0 


0 


0 


SER 
F213 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


TRP 
F214 


0 


0 


0 


0 


0 


0 


0 


0 


-10.472 


0 


0 


0 


PHE 

F215 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


ILE F 

216 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


GLN 
F217 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


SER 
F218 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


LEU 
F219 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


CYS 
F220 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


ALA 
F221 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


MET 
F222 


o 






0 


0 


0 


0 


0 


-13.472 


-13.976 


0 


0 


LEU 
F223 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


LYS 
F224 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


GLN 
F225 


0 


0 


0 


0 


-16.171 


0 


0 


0 


0 


0 


0 


0 


TYR 
F226 


0 


0 


0 


0 


0 


o ; 


0 


0 


0 


0 


0 


0 


ALA 
F227 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


ASP 
F228 


0 


0 


0 


0 


-16.171 


0 


0 


0 


0 


0 


-17.528 


o I 


LYS 
F229 


0 


0 


0 


0 


-16.171 


0 


0 


0 


0 


0 


-17.528 


0 


LEU I 
F230 


-25.098 


-18.530 


0 


-19.593 


-19.171 


0 


0 


0 


-12.472 


0 


-22.528 


0 



207526_2.DOC 



30- 





acet- 
amide 


acetone 


ben- 
zene 


carbo- 
xylic acid 


dimethyl 
sulfoxide 


ethanol 


imida- 
zole 


iso- 
butane 


pyri mi- 
dine 


tetra- 
hydro- 
furan 


urea 


HzO 


GLU 

F231 


0 


0 


0 


0 


0 


! 0 


0 


0 


0 


0 


0 


0 


PHE 

F232 


0 


0 


0 


0 


0 


! 0 


0 


0 


0 


0 


0 


0 


MET 
F233 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


HIE F 
234 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


ILE F 

235 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


LEU 

F236 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


THR 

F237 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


ARG 
F238 


-25.098 


-1 8.530 


0 


-19.593 


-23.171 


-22.233 


-27.307 


0 


-13.472 


-12.976 


-22.528 


-21 


VAL 
F 239 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


ASN 
F240 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


ARG 
F241 


-24.098 


-15.530 


-8.808 


0 


-20.171 


0 


-26.307 


0 


-13.472 


-13.976 


-23.528 


0 


LYS 
F242 


-24.098 


-13.530 


-5.808 


0 


0 


0 


-26.307 


0 


-13.472 


-13.976 


-20.528 


0 


VAL 
F243 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


ALA 
F244 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


THR 
F245 


-24.098 


-15.530 


-6.808 


-19.593 


-18.171 


0 


-19.307 


0 


-13.472 


-9.976 


-23.528 


0 


GLU 
F246 


-24.098 


-15.530 


0 


0 


-20.171 


0 


-26.307 


0 


-13.472 


-13.976 


-23.528 


0 


PHE 
F247 


0 


-15.530 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


GLU 
F248 


0 


-15.530 


0 


-19.593 


-18.171 


-12.233 


0 


0 


-10.472 


0 


-18.528 


-16 


SER 
F249 


0 


0 


0 


0 


0 


0 


0 


0 


-10.472 


0 


0 


0 


PHE 
F250 


0 


0 


0 


0 


0 


0 


0 


0 


-10.472 


0 


0 


0 


SER 
F251 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


PHE 
F252 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


ASP 
F253 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


ALA 
F254 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


THR 
F255 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


PHE 
F256 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


HIE F 
257 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


ALA 
F258 


0 


0 


0 


-12.593 


0 


-12.233 


0 


0 


0 


0 


0 


0 


LYS 
F259 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


LYS 
F260 


0 


-15.530 


0 


-1 9.593 


-18.171 


0 


0 


0 


0 


0 


-18.528 


-16 


GLN 
F261 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


ILE F 
262 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


PRO 
F263 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


CYS 
F264 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


ILE F 

265 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


VAL 
F266 


o 


Q 


o 


0 


0 


0 


0 


0 


0 


0 


0 


0 


SER 
F267 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


MET 
F 268 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


LEU 

F 269 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


THR 
F270 


-23.098 


-14.530 


o 


0 


-20.171 


0 


-27.307 


-4.330 


-11.472 


-15.976 


0 


0 


LYS 
F 271 


-23.098 


-15.530 


-7.808 


0 


-20.171 


0 


-27.307 


-4.330 


-11.472 


-15.976 


0 


0 I 


GLU 
F272 


-18.098 


-13.530 


0 


-16.593 


-17.171 


-15.233 


-21.307 


0 


0 


0 


-20.528 


-26 


LEU 

F 273 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


TYR 
F274 


0 


-13.530 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 



207526_2.DOC 



-31 - 





acet- 
amide 


acetone 


ben- 
zene 


carbo- 
xylic acid 


dimethyl 
sulfoxide 


ethanol 


imida- 
zole 


iso- 
butane 


pyrimi- 
dine 


tetra- 
hydro- 
furan 


urea 


H 2 0 


PHE 

F275 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


TYR 
F276 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


HIDF 

277 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


NME 
F999 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 



EXAMPLE 2 

[0069] The following data in Table 2 was generated from a simulation 

conducted according to the methods of the present invention on the protein 
Caspase-8. Amino acids are listed on the left hand side, while the fragments 
are listed at the top. The binding affinities associated with the fragment- 
residue pairs are listed. 

[0070] Table 2 - Fragment Binding Affinity for Caspase-8 





acet- 
amide 


acetone 


ben- 
zene 


carbo- 
xylic 
acid 


dimethyl 
sulfoxide 


ethanol 


imidazol 
e 


iso- 
butane 


pyrimi- 
dtne 


tetra- 
hydro- 
furan 


urea 


H z O 


ACE 
AO 


-27.098 


-8.530 


0 


0 


-11.171 


0 


-14.307 


0 


0 


-6.976 


-22.528 


-17 


ASP 

A 223 


-27.098 


-20.530 


-3.808 


-12.593 


-22.171 


-13.233 


-30.307 


0 


-14.472 


-1 1 .976 


-22.528 


-17 


LYS 
A 224 


-27.098 


-20.530 


-3.808 


-12.593 


-22.171 


-14.233 


-30.307 


0 


-14.472 


-1 1 .976 


-22.528 


-16 


VAL 
A 225 


-13.098 


-11.530 


-4.808 


-12.593 


-15.171 


-10.233 


0 


-2.330 


-9.472 


-9.976 


-19.528 


0 


TYR 

A 226 


-26.098 


-21.530 


-4.808 


-10.593 


-26.171 


-16.233 


-20.307 


-2.330 


-1 1 .472 


-11.976 


-20.528 


-11 


GLN 
A 227 


-26.098 


-21.530 


-4.808 


-10.593 


-26.171 


-1 1 .233 


-20.307 


-2.330 


-1 1 .472 


-1 1 .976 


-20.528 


-11 


MET 

A 228 


-13.098 


-12.530 


0 


-10.593 


-18.171 


0 


-13.307 


0 


-9.472 


-8.976 


-22.528 


0 


LYS 
A 229 


-22.098 


-12.530 


-4.808 


-13.593 


-18.171 


-15.233 


-25.307 


-2.330 


-1 1 .472 


-7.976 


-30.528 


0 


SER 
A 230 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


LYS 
A 231 


0 


0 


0 


-13.593 


0 


-15.233 


0 


0 


-7.472 


0 


0 


-12 


PRO 

A 232 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


ARG 
A 233 


0 


0 


0 


-13.593 


0 


-15.233 


0 


0 


0 


0 


0 


-12 


GLY 
A 234 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


TYR 
A 235 


0 


0 


0 


-8.593 


0 


0 


0 


0 


0 


0 


0 


0 


CYS 
A 236 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


LEU 

A 237 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


ILEA 
238 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


ILEA 
239 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


ASN 
A 240 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


ASN 
A 241 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


HID A 
242 


0 


-11.530 


0 


0 


0 


0 


-14.307 


0 


0 


0 


0 


0 


ASN 
A 243 


0 


0 


0 


0 


0 


-9.233 


0 


0 


0 


0 


0 


0 


PHE 
A 244 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 



207526_2.DOC 



-32- 





acet- 
amide 


acetone 


ben- 
zene 


carbo- 
xylic 
acid 


dimethyl 
sulfoxide 


ethanol 


imidazol 
e 


iso- 
butane 


pyrimi- 
dine 


tetra- 
hydro- 
furan 


urea 


H 2 0 


ALA 
A 245 


0 


0 


-3.808 


0 


0 


0 i 


0 


0 


0 


0 


0 


0 


LYS 
A 246 


-15.098 


-1 1 .530 


-4.808 


-8.593 


-15.171 


-9.233 


-17.307 


0 


-15.472 


-6.976 


-12.528 


-11 


ALA 
A 247 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


ARG 
A 248 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


GLU 
A 249 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


LYS 
A 250 


-28.098 


-12.530 


-3.808 


-15.593 


-11.171 


-13.233 


-23.307 


0 


-15.472 


-8.976 


-32.528 


-14 


VAL 
A 251 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


-1 1 


PRO 
A 252 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


LYS 
A 253 


-14.098 


-8.530 


0 


-1 1 .593 


-10.171 


-12.233 


-14.307 


-2.330 


-9.472 


0 


-17.528 


-14 


LEU 
A 254 


0 


0 


0 


0 


0 


0 


0 


-2.330 


0 


0 


0 


0 


HID A 

255 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


SER 
A 256 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


1LE A 
257 


0 


0 


-3.808 


0 


0 


0 


0 


-3.330 


0 


0 


0 


0 


ARG 
A 258 


0 


0 


-8.808 


-7.593 


0 


-12.233 


-15.307 


-2.330 


-11 .472 


-6.976 


0 


0 


ASP 
A 259 


-14.098 


0 


-8.808 


-7.593 


0 


-12.233 


-15.307 


0 


-11 .472 


0 


-12.528 


0 


ARG 
A 260 


-30.098 


-25.530 


-8.808 


-22.593 


-28.171 


-18.233 


-32.307 


0 


0 


-16.976 


-31 .528 


-10 


ASN 
A 261 


-14.098 


-1 1.530 


0 


-7.593 


0 


-12.233 


-15.307 


0 


-11 .472 


0 


-12.528 


0 


GLY 
A 262 


0 


-1 1 .530 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


THR 
A 263 


0 


-1 1 .530 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


HIE A 
264 


0 


-1 1 .530 


0 


0 


0 


0 


-14.307 


0 


0 


0 


0 


0 


LEU 
A 265 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


ASP 
A 266 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


ALA 
A 267 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


GLY 
A 268 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


ALA 
A 269 


0 


0 


0 


0 


0 


0 


0 


-2.330 


0 


0 


0 


0 


LEU 
A 270 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


THR 
A 271 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


THR 

A 272 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


THR 
A 273 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


PHE 
A 274 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


GLU 
A 275 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


GLU 
A 276 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


LEU 
A 277 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


HIE A 
278 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


PHE 
A 279 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


GLU 

A 280 


0 


0 


0 


-9.593 


0 


0 


0 


0 


0 


0 


0 


0 


ILEA 
281 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


LYS 
A 282 


0 


0 


0 


-9.593 


0 


0 


0 


0 


0 


0 


0 


0 


PRO 
A 283 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


HIDA 
284 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


ASP 
A 285 


0 


0 


0 


0 


0 


0 


-14.307 


-2.330 


0 


0 


0 


0 


ASP 
A 286 


-13.098 


-1 1 .530 


-3.808 


-8.593 


-14.171 


-9.233 


-16.307 


0 


-7.472 


0 


-12.528 


-10 


CYS 
A 287 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


THR 
A 288 


-15.098 


-11.530 


0 


-15.593 


-12.171 


0 


-13.307 


o 


-15.472 


0 


0 


0 



207526_2.DOC 



-33- 





acet- 
amide 


acetone 


ben- 
zene 


carbo- 
xylic 
acid 


dimethyl 
sulfoxide 


ethanol 


imidazol 
e 


iso- 
butane 


pyrimi- 
dine 


tetra- 
hydro- 
furan 


urea 


H 2 0 


VAL 
A 289 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


GLU 
A 290 


-28.098 


-12.530 


0 


-15.593 


-1 1.171 


-13.233 


-23.307 


0 


0 


0 


-32.528 


-10 


GLN 
A 291 


-15.098 


-11.530 


0 


-8.593 


-14.171 


-9.233 


-18.307 


0 


0 


-6.976 


-12.528 


0 


ILE A 

292 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


TYR 
A 293 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 ! 


0 


GLU 

A 294 


-15.098 


-9.530 


0 


0 


-13.171 


0 


-21 .307 


0 


0 


-6.976 


-15.528 


0 


ILEA 

295 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


LEU 

A 296 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


LYS 
A 297 


-15.098 


-9.530 


-3.808 


0 


-13.171 


0 


-21.307 


0 


0 


-6.976 


-15.528 


0 


ILEA 
298 


0 


0 


0 


0 


0 


0 


-13.307 


0 


0 


-6.976 


-15.528 


0 


TYR 
A 299 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


GLN 

A 300 


-13.098 


-1 1 .530 


0 


-11.593 


-12.171 


0 


-13.307 


0 


-8.472 


-8.976 


0 


0 


LEU 
A 301 


-13.098 


-11.530 


0 


-11.593 


-12.171 


0 


-13.307 


0 


-8.472 


-8.976 


0 


0 


MET 

A 302 


-13.098 


-11.530 


0 


-1 1 .593 


-12.171 


0 


-13.307 


0 


-8.472 


-6.976 


0 


0 


ASP 
A 303 


-13.098 


-1 1 .530 


0 


0 


-12.171 


0 


0 


0 


0 


0 


0 


0 


HIE A 

304 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


SER 
A 305 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


ASN 
A 306 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


MET 

A 307 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


ASP 
A 308 


0 


0 


0 


0 


0 


0 


-20.307 


0 


-7.472 


0 


-17.528 


-11 


CYS 
A 309 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


PHE 
A 310 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


ILEA 
311 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


CYS 
A 312 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


CYS 
A 313 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


ILEA 
314 


0 


0 


0 


0 


0 


0 


0 


o 


0 


0 


0 


0 


LEU 
A 315 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


SER 
A 316 


-27.098 


-25.530 


-8.808 


0 


0 


-18.233 


-25.307 


0 


0 


-16.976 


0 


0 


HIE A 
317 


-26.098 


-25.530 


-8.808 


-22.593 


-28.171 


-18.233 


-32.307 


-3.330 


0 


-16.976 


0 


0 


GLY 
A 318 


0 


0 


0 


0 


0 


0 


0 


-2.330 


0 


0 


0 


0 


ASP 

A 319 


-14.098 


-10.530 


-3.808 


-1 1 .593 


-15.171 


-1 1 .233 


-20.307 


0 


-9.472 


-7.976 


-17.528 


-14 


LYS 

A 320 


-23.098 


-15.530 


-3.808 


-8.593 


-18.171 


-9.233 


-26.307 


0 


-10.472 


-7.976 


-24.528 


0 


GLY 
A 321 


0 


0 


0 


0 


0 


0 


0 


-2.330 


0 


0 


0 


0 


ILEA 
322 


0 


0 


0 


-12.593 


0 


0 


0 


0 


0 


0 


0 


0 


ILEA 

323 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


TYR 
A 324 


0 


0 


0 


0 


0 


0 


0 


-2.330 


0 


0 


0 


0 


GLY 
A 325 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


THR 
A 326 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


ASP 
A 327 


-28.098 


-12.530 


0 


-15.593 


-11.171 


-13.233 


-23.307 


0 


-15.472 


-8.976 


-32.528 


-14 


GLY 
A 328 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


GLN 
A 329 


0 


0 


0 


-7.593 


-10.171 


0 


0 


0 


0 


0 


0 


-11 


GLU 

A 330 


0 


-8.530 


0 


-1 1 .593 


-10.171 


-12.233 


-14.307 


0 


-9.472 


0 


-17.528 


-11 


ALA 
A 331 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


PRO 
A 332 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 



207526_2.DOC 



-34- 





acet- 
amide 


acetone 


ben- 
zene 


carbo 
xylic 
acid 


dimethyl 
sulfoxide 


ethanol 


imidazol 
e 


iso- 
butane 


pyri mi- 
dine 


tetra- 
hydro- 
furan 


urea 


H z O 


ILE A 

333 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


TYR 
A 334 


0 


0 


0 


0 


0 


0 


0 


-2.330 


0 


0 


0 


0 


GLU 
A 335 


-14.098 


0 


0 


0 


0 


-7.233 


-17.307 


0 


0 


0 


-16.528 


0 


LEU 

A 336 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


THR 
A 337 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


SER 
A 338 


-14.098 


0 


0 


0 


0 


-7.233 


0 


0 


0 


0 


0 


0 


GLN 
A 339 


-14.098 


0 


0 


0 


0 


0 


-17.307 


0 


0 


0 


-16.528 


0 


PHE 

A 340 


0 


0 


0 


0 


0 


0 - 


0 


0 


0 


0 


0 


0 


THR 
A 341 


0 


0 


0 


0 


0 


-8.233 


0 I 


0 


0 


0 


0 


0 


GLY 
A 342 


-16.098 


0 


0 


0 


0 


-7.233 


0 


-2.330 


0 


0 


-13.528 


0 


LEU 

A 343 


0 


0 


0 


0 


0 


0 


0 


-2.330 


0 


0 


0 


0 


LYS 
A 344 


-14.098 


-8.530 


-3.808 


0 


-11.171 


-8.233 


-17.307 


0 


0 


0 


-16.528 


0 


CYS 
A 345 


-13.098 


-1 1 .530 


0 


-11.593 


-10.171 


0 


0 


0 


0 


-6.976 


0 


0 


PRO 
A 346 


-13.098 


-11.530 


0 


-11.593 


-12.171 


0 


-13.307 


0 


-8.472 


-6.976 


0 


0 


SER 
A 347 


-13.098 


-1 1 .530 


0 


-11.593 


-12.171 


0 


-13.307 


0 


-8.472 


-6.976 


0 


0 


LEU 

A 348 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


ALA 
A 349 


-16.098 


0 


0 


0 


0 


-7.233 


0 


-3.330 


0 


0 


-13.528 


-14 


GLY 
A 350 


-26.098 


-21.530 


0 


-1 1 .593 


-26.171 


-16.233 


-20.307 


-2.330 


-10.472 


-12.976 


-20.528 


-15 


LYS 
A 351 


-26.098 


-21.530 


0 


-11.593 


-26.171 


0 


-20.307 


-2.330 


-9.472 


-9.976 


-17.528 


-11 


PRO 
A 352 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


-1 1 


LYS 
A 353 


-16.098 


0 


0 


0 


0 


-7.233 


0 


0 


0 


0 


-13.528 


-14 


VAL 
A 354 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


PHE 
A 355 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


PHE 

A 356 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


ILE A 
357 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


GLN 
A 358 


-30.098 


0 


-8.808 


-22.593 


-28.171 


-18.233 


-32.307 


0 


0 


-16.976 


-30.528 


0 


ALA 

A 359 


-30.098 


-25.530 


-8.808 


-7.593 


-28.171 


-18.233 


-25.307 


0 


0 


-16.976 


-22.528 


0 


CYS 
A 360 


-30.098 


-25.530 


-8.808 


-22.593 


-28.171 


-18.233 


-32.307 


-3.330 


0 


-16.976 


-31.528 


0 


GLN 
A 361 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


GLY 
A 362 


0 


0 


-3.808 


0 


0 


0 


-21.307 


0 


0 


-6.976 


0 


0 


ASP 
A 363 


-23.098 


-1 5.530 


-3.808 


-8.593 


-18.171 


-9.233 


-26.307 


-3.330 


-8.472 


-7.976 


-24.528 


0 


ASN 
A 364 


-13.098 


-8.530 


-4.808 


0 


-11.171 


-7.233 


-21.307 


0 


-8.472 


-9.976 


-11.528 


0 


TYR 
A 365 


-13.098 


0 


-3.808 


0 


0 


-7.233 


-14.307 


0 


-7.472 


-6.976 


-11.528 


0 


GLN 
A 366 


-13.098 


0 


0 


0 


0 


-7.233 


-15.307 


-3.330 


-10.472 


-6.976 


-11.528 


0 


LYS 
A 367 


-13.098 


-8.530 


-4.808 


0 


-10.171 


-7.233 


-15.307 


-2.330 


-10.472 


-6.976 


-11.528 


0 


GLY 
A 368 


0 


0 


0 


0 


0 


0 


0 


-3.330 


0 


0 


0 


0 


ILEA 
369 


0 


0 


-4.808 


0 


0 


0 


0 


-3.330 


0 


-6.976 


0 


0 


PRO 
A 370 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


VAL 
A 371 


-1 3.098 


-10.530 


0 


0 


0 


0 


0 


-2.330 


0 


0 


-11.528 


0 


GLU 
A 372 


-13.098 


0 


0 


0 


0 


0 


0 


0 


0 


0 


-11.528 


0 


THR 

A 373 


-13.098 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


ASP 
A 374 


-13.098 


0 


0 


0 


0 


0 


0 


0 


0 


0 


-11.528 


0 


NME 
A 999 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


ACE 
BO 


0 


0 


0 


0 


-10.171 


0 


0 


0 


0 


0 


0 


0 
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acet- 
amide 


acetone 


ben- 
zene 


carbo- 
xylic 
acid 


dimethyl 
sulfoxide 


ethanol 


imidazol 
e 


iso- 
butane 


pyrimi- 
dine 


tetra- 
hydro- 
furan 


urea 


H 2 0 


THR 

B390 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


ARG 
B391 


0 


0 


0 


0 


-11.171 


0 


0 


0 


0 


-6.976 


0 


0 


TYR 
B392 


0 


0 


0 


0 


0 


0 


0 


-2.330 


0 


0 


0 


0 


ILE B 

393 


-1 6.098 


0 


0 


0 


-23.171 


-7.233 


0 


-3.330 


0 


-9.976 


-13.528 


0 


PRO 
B 394 


-16.098 


0 


0 


0 


0 


-7.233 


0 


0 


0 


0 


-13.528 


0 


ASP 
B395 


-16.098 


0 


0 


0 


0 


-8.233 


0 


-2.330 


0 


0 


-13.528 


0 


GLU 
B396 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


ALA 
B397 


-16.098 


0 


0 


0 


0 


-7.233 


0 


0 


0 


0 


-13.528 


-14 


ASP 
B 398 


-16.098 


0 


0 


0 


0 


-7.233 


0 


0 


0 


0 


-13.528 


-14 


PHE 

B 399 


0 


0 


0 


0 


0 


0 


0 


-4.330 


0 


0 


0 


0 


LEU 
B400 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


LEU 
B401 


0 


0 


-3.808 


0 


0 


0 


0 


-3.330 


0 


0 


0 


0 


GLY 
B402 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


MET 
B403 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


ALA 
B404 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


THR 
B405 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


VAL 
B406 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


ASN 
B407 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


ASN 
B408 


0 


0 


0 


0 


0 


0 


-21.307 


0 


0 


0 


0 


0 


CYS 
B409 


-13.098 


0 


0 


0 


0 


0 


0 


0 


0 


0 


-11.528 


0 


VAL 
B410 


0 


0 


0 


0 


0 


0 


0 


-2.330 


0 


0 


0 


0 


SER 
B411 


-30.098 


-25.530 


-8.808 


-22.593 


-28.171 


-18.233 


-32.307 


0 


0 


-16.976 


-31 .528 


0 


TYR 
B412 


-30.098 


0 


-8.808 


0 


0 


0 


0 


-3.330 


0 


-7.976 


0 


0 


ARG 
B413 


-30.098 


-25.530 


-8.808 


-22.593 


-28.171 


-18.233 


-32.307 


-3.330 


-11.472 


-16.976 


-31 .528 


0 


ASN 
B414 


0 


-9.530 


0 


0 


-10.171 


0 


0 


-3.330 


0 


0 


-13.528 


0 


PRO 
B 415 


-14.098 


0 


-8.808 


0 


0 


-9.233 


-15.307 


0 


0 


0 


-12.528 


0 


ALA 
B416 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


GLU 
B417 


0 


-9.530 


0 


0 


-15.171 


0 


0 


0 


-15.472 


-8.976 


-13.528 


0 


GLY 
B418 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


-13.528 


0 


THR 
B419 


0 


0 


-8.808 


0 


-10.171 


0 


0 


0 


0 


0 


0 


0 


TRP 

B 420 


0 


-9.530 


-3.808 


0 


-10.171 


0 


0 


-3.330 


-15.472 


-8.976 


-13.528 


0 


TYR 
B421 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


ILE B 

422 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


GLN 
B423 


-20.098 


-14.530 


-4.808 


0 


-18.171 


0 


-22.307 


-3.330 


-15.472 


-9.976 


-13.528 


0 


SER 
B424 


-21.098 


-19.530 


0 


0 


-18.171 


0 


-22.307 


0 


0 


0 


-20.528 


0 


LEU 
B425 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


CYS 
B426 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


GLN 
B427 


-21.098 


-19.530 


-3.808 


-9.593 


-18.171 


-12.233 


-22.307 


-2.330 


-12.472 


-8.976 


-22.528 


0 


SER 
B 426 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


LEU 
B429 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


ARG 
B430 


0 


0 


0 


0 


0 


0 


0 


-2.330 


0 


0 


0 


0 


GLU 
B431 


-19.098 


-14.530 


0 


0 


-21.171 


-7.233 


-22.307 


0 


-10.472 


-10.976 


-17.528 


-11 


ARG 
B 432 


0 


0 


0 


0 


-18.171 


-7.233 


0 


0 


0 


0 


-17.528 


-11 


CYS 
B433 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 
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acet- 
amide ! 


acetone 


ben- 
zene 


carbo- 
xylic 
acid 


dimethyl 
sulfoxide 


ethanol 


imidazol 
e 


iso- 
butane 


pyrimi- 
dine 


tetra- 
hydro- 
furan 


urea 


H 2 0 


PRO 
B434 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


ARG 
B435 


-19.098 


-14.530 


0 


0 


-21.171 


0 


-22.307 


0 


-10.472 


-10.976 


-17.528 


0 


GLY 
B436 


0 


0 


0 


0 


-12.171 


0 


0 


0 


0 


0 


0 


0 


ASP 
B437 


-19.098 


-11.530 


0 


-20.593 


-13.171 


0 


0 


0 


-12.472 


-6.976 


-14.528 


0 


ASP 
B438 


-19.098 


-11.530 


0 


-20.593 


-13.171 


-14.233 


-14.307 


0 


-12.472 


-6.976 


-14.528 


-15 


ILE B 

439 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


LEU 
B440 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


THR 
B441 


0 


0 


0 


0 


-13.171 


0 


0 


0 


0 


0 


0 


0 


ILE B 
442 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


LEU 
B443 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


THR 
B444 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


GLU 
B445 


0 


0 


0 


0 


0 


-7.233 


0 


0 


0 


0 


-17.528 


-11 


VAL 
B446 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


ASN 
B447 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


TYR 
B448 


-13.098 


0 


0 


0 


0 


-7.233 


0 


-2.330 


0 


0 


-11.528 


0 


GLU 
B449 


-24.098 


-19.530 


0 


-14.593 


-18.171 


-17.233 


-26.307 


0 


-12.472 


-7.976 


-27.528 


-17 


VAL 
B450 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


SER 
B451 


-13.098 


0 


0 


0 


-10.171 


0 


-13.307 


-3.330 


-10.472 


-6.976 


-1 1 .528 


0 


ASN 
B452 


-24.098 


-10.530 


-4.808 


-11.593 


-10.171 


-9.233 


0 


-2.330 


-10.472 


-7.976 


-27.528 


0 


LYS 
B453 


-24.098 


-19.530 


-5.808 


-14.593 


-18.171 


-17.233 


-26.307 


-3.330 


-15.472 


-11.976 


-27.528 


-17 


ASP 
B454 


-16.098 


0 


-4.808 


-12.593 


0 


-7.233 


0 


-2.330 


0 


0 


0 


0 


ASP 
B455 


0 


0 


0 


-7.593 


0 


0 


0 


0 


0 


0 


0 


0 


LYS 
B 456 


-16.098 


-9.530 


0 


-12.593 


-11.171 


-7.233 


0 


0 


0 


-6.976 


0 


0 


LYS 
B457 


0 


0 


0 


-7.593 


-11.171 


-7.233 


0 


0 


0 


0 


0 


0 


ASN 
B458 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


MET 
B459 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


GLY 
B460 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


LYS 
B461 


-13.098 


-10.530 


-4.808 


-10.593 


-10.171 


-7.233 


-13.307 


-4.330 


-10.472 


-6.976 


-11 .528 


0 


GLN 
B462 


0 


0 


0 


o 


0 


0 


0 


0 


0 


0 


0 


0 


MET 
B463 


0 


0 


0 


0 


0 


0 


-13.307 


-3.330 


-10.472 


0 


0 


0 


PRO 
B464 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


GLN 
B465 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


PRO 
B466 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


THR 
B467 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


PHE 
B468 


0 


0 


0 


0 


0 


0 


0 


-2.330 


0 


0 


0 


0 


THR 
B469 


0 


0 


0 


0 


0 


0 


0 


-3.330 


0 


0 


0 


0 


LEU 
B470 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


ARG 
B471 


-20.098 


-14.530 


-3.808 


-8.593 


-23.171 


-7.233 


-23.307 


-2.330 


-1 1 .472 


-12.976 


-16.528 


0 


LYS 
B472 


-26.098 


-21 .530 


-4.808 


-1 1 .593 


-26.171 


-16.233 


-21.307 


-3.330 


-14.472 


-13.976 


-20.528 


-15 


LYS 
B473 


-21.098 


-13.530 


-3.808 


-20.593 


-13.171 


-14.233 


-14.307 


-2.330 


-12.472 


-8.976 


-14.528 


-15 


LEU 
B474 


0 


0 


O 


0 


0 


' o 


0 


0 


0 


0 


0 


0 


VAL 
B475 


0 


0 


O 


-10.593 


0 


0 


0 


0 


0 


0 


-19.528 


0 


PHE 
B476 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


PRO 
B477 


-22.098 


-12.530 


0 


-10.593 


-16.171 


-7.233 


-22.307 


-2.330 


-9.472 


0 


-30.528 


0 
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acet- 
amide 


acetone 


ben- 
zene 


carbo- 
xylic 
acid 


dimethyl 
sulfoxide 


ethanol 


imidazot 
e 


iso- 
butane 


pyn mi- 
dine 


tetra- 
hydro- 
furan 


urea 


H 2 0 


SER 
B478 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


ASP 
CB 
479 


-22.098 


-10.530 


0 


-13.593 


-15.171 


-15.233 


-25.307 


0 


-9.472 


0 


-30.528 


-14 



[0071] While various embodiments of the present invention have been 

described above, it should be understood that they have been presented by way 
of example, and not limitation. It will be apparent to persons skilled in the 
relevant art that various changes in detail can be made therein without 
departing from the spirit and scope of the invention. Thus the present 
invention should not be limited by any of the above-described exemplary 
embodiments. 

[0072] All references and publications referred to herein are hereby 

incorporated by reference in their entirety. 



207526 2.DOC 



